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Abstract 

A careful analysis of conditioning in the Sleeping Beauty problem is done, using the 
formal model for reasoning about knowledge and probability developed by Halpern and 
Tuttle. While the Sleeping Beauty problem has been viewed as revealing problems with 
conditioning in the presence of imperfect recall, the analysis done here reveals that the 
problems are not so much due to imperfect recall as to asynchrony. The implications of 
this analysis for van Fraassen's Reflection Principle and Savage's Sure-Thing Principle are 
considered. 



1 Introduction 



The standard approach to updating beliefs in the probability literature is by conditioning. But it 
turns out that conditioning is somewhat problematic if agents have imperfect recall. In the eco- 
nomics community this issue was brought to the fore by the work of Piccione and Rubinstein 
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N00014-01- 10-51 1, by the DoD Multidisciplinary University Research Initiative (MURI) program administered 
by the ONR under grant N00014-01-1-0795, and by AFOSR under grant F49620-02- 1-0101. A preliminary 
version of this paper appears in Principles of Knowledge Representation and Reasoning: Proceedings of the 
Ninth International Conference (KR 2004). 
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119971 (to which was dedicated a special issue of the journal Games and Economic Behav- 
ior). There has also been a recent surge of interest in the topic in the philosophy community, 
inspired by a re-examination by Elga H2000I of one of the problems considered by Piccione 
and Rubinstein, the so-called Sleeping Beauty problem} (Some recent work on the problem 
includes HArntzenius 2()()3UDorr 2()02"t ILewis 2()()lUMonton 20021 .) 

I take the Sleeping Beauty problem as my point of departure in this paper too. I argue 
that the problems in updating arise not just with imperfect recall, but also in asynchronous 
systems, where agents do not know exactly what time it is, or do not share a global clock. 
Since both human and computer agents are resource bounded and forgetful, imperfect recall is 
the norm, rather than an unusual special case. Moreover, there are many applications where it 
is unreasonable to assume the existence of a global clock. Thus, it is important to understand 
how to do updating in the presence of asynchrony and imperfect recall. 

The Sleeping Beauty problem is described by Elga as follows: 

Some researchers are going to put you to sleep. During the two days that your 
sleep will last, they will briefly wake you up either once or twice, depending on 
the toss of a fair coin (heads: once; tails: twice). After each waking, they will put 
you back to sleep with a drug that makes you forget that waking. When you are 
first awakened, to what degree ought you believe that the outcome of the coin toss 
is heads? 

Elga argues that there are two plausible answers. The first is that it is 1/2. After all, it was 
1/2 before you were put to sleep and you knew all along that you would be woken up (so you 
gain no useful information just by being woken up). Thus, it should still be 1/2 when you 
are actually woken up. The second is that it is 1/3. Clearly if this experiment is carried out 
repeatedly, then in the long run, at roughly one third of the times that you are woken up, you 
are in a trial in which the coin lands heads. 

Elga goes on to give another argument for 1/3, which he argues is in fact the correct answer. 
Suppose you are put to sleep on Sunday, so that you are first woken on Monday and then 
possibly again on Tuesday if the coin lands tails. Thus, when you are woken up, there are three 
events that you consider possible: 

• ei: it is now Monday and the coin landed heads; 

• e 2 : it is now Monday and the coin landed tails; 

• e^: it is now Tuesday and the coin landed tails. 

Elga's argument has two steps: 

1 . If, after waking up, you learn that it is Monday, you should consider e\ and e 2 equally 
likely. Since, conditional on learning that it is Monday, you consider e\ and e 2 equally 
likely, you should consider them equally likely unconditionally. 

1 So named by Robert Stalnaker. 
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2. Conditional on the coin landing tails, it also seems reasonable that e 2 and e 3 should be 
equally likely; after all, you have no reason to think Monday is any more or less likely 
that Tuesday if the coin landed tails. Thus, unconditionally, e 2 and e 3 should be equally 
likely. 

From these two steps, it follows that e\, e 2 , and e 3 are equally likely. The only way that this 
could happen is for them all to have probability 1/3. So heads should have probability 1/3. 

Suppose that the story is changed so that (1) heads has probability .99 and tails has proba- 
bility .01, (2) you are woken up once if the coin lands heads, and (3) you are woken up 9900 
times if the coin lands tails. In this case, Elga's argument would say that the probability of 
tails is .99. Thus, although you know you will be woken up whether the coin lands heads or 
tails, and you are initially almost certain that the coin will land heads, when you are woken up 
(according to Elga's analysis) you are almost certain that the coin landed tails! 

How reasonable is this argument? The second step involves an implicit appeal to the Prin- 
ciple of Indifference. But note that once e\ and e 2 are taken to be equally likely, the only way 
to get the probability of heads to be 1/2 is to give e 3 probability 0, which seems quite unrea- 
sonable. Thus, an appeal to the Principle of Indifference is not critical here to argue that 1/2 is 
not the appropriate answer. 

What about the first step? If your probability is represented by Pr then, by Bayes' Rule, 

Pr(heads \ Monday) 

Clearly Pr( Monday \ heads) = 1. By the Principle of Indifference, Pv(Monday \ tails) = 
1/2. If we take Pr(heads) = Pr (tails) = 1/2, then we get Pr(heads \ Monday) = 2/3. Intu- 
itively, it being Monday provides stronger evidence for heads than for tails, since Pr(Monday \ 
heads) is larger than Pr(Monday \ tails). Of course, this argument already assumes that 
Pr(heads) = 1/2, so we can't use it to argue that Pr(heads) = 1/2. The point here is simply 
that it is not blatantly obvious that Pr(heads \ Monday) should be taken to be 1/2. 2 

To analyze these arguments, I use a formal model for reasoning about knowledge and prob- 
ability that Mark Tuttle and I developed [Halpern and Tuttle 1993] (HT from now on), which 



Pr(Monday 


heads) Pr(heads) 


Pr(Monday 


heads) Pr(heads) + Pr(Monday 


tails) Pr (tails) 



in turn is based on the "multiagent systems" framework for reasoning about knowledge in com- 
puting systems, introduced in [Halpern and Fagin 1989| (see [Fagin, Halpern, Moses, and Vardi 1995 1 



for motivation and discussion). Using this model, I argue that Elga's argument is not as com- 
pelling as it may seem, although not for the reasons discussed above. The problem turns out 
to depend on the difference between the probability of heads conditional on it being Monday 
vs. the probability of heads conditional on learning that it is Monday. The analysis also reveals 
that, despite the focus of the economics community on imperfect recall, the real problem is 
not imperfect recall, but asynchrony: the fact that Sleeping Beauty does not know exactly what 
time it is. 



2 Thanks to Alan Hajek for making this point. 
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I then consider other arguments and desiderata traditionally used to justify probabilistic 
conditioning, such as frequency arguments, betting arguments, van Fraassen's 119841 Reflec- 
tion Principle, and Savage's H1954I Sure-Thing Principle. I show that our intuitions for these 
arguments are intimately bound up with assumptions such as synchrony and perfect recall. 

The rest of this paper is organized as follows. In the next section I review the basic mul- 
tiagent systems framework. In Section 3, 1 describe the HT approach to adding probability to 
the framework when the system is synchronous. HT generalized their approach to the asyn- 
chronous case; their generalization supports the "evidential argument" above, giving the an- 
swer 1/2 in the Sleeping Beauty problem. I also consider a second generalization, which gives 
the answer 1/3 in the Sleeping Beauty problem (although not exactly by Elga's reasoning). In 
SectionH I consider other arguments and desiderata. I conclude in Section[5] 



2 The framework 



2.1 The basic multiagent systems framework 



In this section, we briefly review the multiagent systems framework; see |Fagin, Halpe rn, Moses, and Vardi 1995 1 
for more details. 

A multiagent system consists of n agents interacting over time. At each point in time, each 
agent is in some local state. Intuitively, an agent's local state encapsulates all the information 
to which the agent has access. For example, in a poker game, a player's state might consist of 
the cards he currently holds, the bets made by the other players, any other cards he has seen, 
and any information he may have about the strategies of the other players (e.g., Bob may know 
that Alice likes to bluff, while Charlie tends to bet conservatively). In the Sleeping Beauty 
problem, we can assume that the agent has local states corresponding to "just woken up", "just 
before the experiment", and "just after the experiment". 

Besides the agents, it is also conceptually useful to have an "environment" (or "nature") 
whose state can be thought of as encoding everything relevant to the description of the system 
that may not be included in the agents' local states. For example, in the Sleeping Beauty 
problem, the environment state can encode the actual day of the week and the outcome of the 
coin toss. In many ways, the environment can be viewed as just another agent. In fact, in the 
case of the Sleeping Beauty problem, the environment can be viewed as the local state of the 
experimenter. 

We can view the whole system as being in some global state, a tuple consisting of the 
local state of each agent and the state of the environment. Thus, a global state has the form 
(s e , si, . . . , s n ), where s e is the state of the environment and Sj is agent i's state, for i = 
1, . . . ,n. 

A global state describes the system at a given point in time. But a system is not a static 
entity. It is constantly changing over time. A run captures the dynamic aspects of a system. 
Intuitively, a run is a complete description of one possible way in which the system's state can 
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evolve over time. Formally, a run is a function from time to global states. For definiteness, I 
take time to range over the natural numbers. Thus, r(0) describes the initial global state of the 
system in a possible execution, r(l) describes the next global state, and so on. A pair (r, m) 
consisting of a run r and time m is called a point. If r(m) = (s e , s\, . . . , s n ), then define 
r e (m) = s e and r;(m) = Sj, z = 1, . . . , n; thus, rj(m) is agent i's local state at the point (r, m) 
and r e (m) is the environment's state at (r, m). I write (r, m) ~j (r', m') if agent z has the same 
local state at both (r, m) and (r',m'), that is, if rj(m) = r-(m'). Let /Q(r, m) = {(r', m') : 
(r, m) ~j (r', m')}. Intuitively, /Q(r, m) is the set of points that i considers possible at (r, m); 
these are the states that i cannot distinguish based basis of i's information at (r, m). Sets of the 
form /Q(r, m) are sometimes called information sets. 

In general, there are many possible executions of a system: there could be a number of 
possible initial states and many things that could happen from each initial state. For example, 
in a draw poker game, the initial global states could describe the possible deals of the hand by 
having player i's local state describe the cards held by player i. For each fixed deal of the cards, 
there may still be many possible betting sequences, and thus many runs. Formally, a system 
is a nonempty set of runs. Intuitively, these runs describe all the possible sequences of events 
that could occur in the system. Thus, I am essentially identifying a system with its possible 
behaviors. 

There are a number of ways of modeling the Sleeping Beauty problem as a system. Perhaps 
simplest is to consider it as a single-agent problem, since the experimenter plays no real role. 
(Note that it is important to have the environment though.) Assume for now that the system 
modeling the Sleeping Beauty problem consists of two runs, the first corresponding to the coin 
landing heads, and the second corresponding to the coin landing tails. (As we shall see, while 
restricting to two runs seems reasonable, it may not capture all aspects of the problem.) There 
are still some choices to be made with regard to modeling the global states. Here is one way: 
At time 0, a coin is tossed; the environment state encodes the outcome. At time 1, the agent is 
asleep (and thus is in a "sleeping" state). At time 2, the agent is woken up. If the coin lands 
tails, then at time 3, the agent is back asleep, and at time 4, is woken up again. Note that I have 
assumed here that time in both of these runs ranges from to 5. Nothing would change if I 
allowed runs to have infinite length or a different (but sufficiently long) finite length. 

Alternatively, we might decide that it is not important to model the time that the agent is 
sleeping; all that matters is the point just before the agent is put to sleep and the points where the 
agent is awake. Assume that Sleeping Beauty is in state b before the experiment starts, in state 
a after the experiment is over, and in state w when woken up. This leads to a model with two 
runs ri and r 2 , where the first three global states in r\ are (H, b), (H, w), and (if, a), and the 
first four global states in r 2 are (T, b), (T, w), (T, w), (T, a). Let IZi be the system consisting 
of the runs r x and r 2 . This system is shown in Figured] (where only the first three global states 
in each run are shown). The three points where the agent's local state is w, namely, (r 1; 1), 
(r 2 , 1), and (r 2 , 2), form what is traditionally called in game theory an information set. These 
are the three points that the agent considers possible when she is woken up. For definiteness, I 
use in much of my analysis of Sleeping Beauty. 
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Notice that 1Z\ is also compatible with a somewhat different story. Suppose that the agent 
is not aware of time passing. At time the coin is tossed, and the agent knows this. If the coin 
lands heads, only one round passes before the agent is told that the experiment is over; if the 
coin lands tails, she is told after two rounds. Since the agent is not aware of time passing, her 
local state is the same at the points (n, 2), (r 2 , 1), and (r 2 , 2). The same analysis should apply 
to the question of what the probability of heads is at the information set. The key point is that 
here the agent does not forget; she is simply unaware of the time passing. 

Various other models are possible: 

• We could assume (as Elga does at one point) that the coin toss happens only after the 
agent is woken up the first time. Very little would change, except that the environment 
state would be (or some other way of denoting that the coin hasn't been tossed) in the 
first two global states of both runs. Call the two resulting runs r[ and r 2 . 

• All this assumes that the agent knows when the coin is going to be tossed. If the 
agent doesn't know this, then we can consider the system consisting of the four runs 

r 1 ,r' 1 ,r 2 ,r' 2 . 

• Suppose that we now want to allow for the possibility that, upon wakening, the agent 
learns that it is Monday (as in Elga's argument). To do this, the system must include 
runs where the agent actually learns that it is Monday. Now two runs no longer suffice. 
For example, we can consider the system 1Z 2 = (r±, r 2 , r{, r 2 ), where r* is the same as 
rj except that on Monday, the agent's local state encodes that it is Monday. Thus, the 
sequence of global states in r* is (H, b), (H, M), (if, a), and the sequence in r 2 is (T, b), 
(T, M), (T, w). 1Z 2 is described in FigureEl Note that on Tuesday in r 2 , the agent forgets 
whether she was woken up on Monday. She is in the same local state on Tuesday in r 2 
as she is on both Monday and Tuesday in r 2 . 

Yet other representations of the Sleeping Beauty problem are also possible. The point that I 
want to emphasize here is that the framework has the resources to capture important distinctions 
about when the coin is tossed and what agents know. 
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Figure 2: An alternate representation of the Sleeping Beauty problem, using 7^2- 



2.2 Synchrony and perfect recall 

One advantage of the multiagent systems framework is that it can be used to easily model 
a number of important assumptions. I focus on two of them here: synchrony, the assump- 
tion that agents know the time, and perfect recall, the assumption that agents do not forget 
[ |Fagin, Halpern, Moses, and Vardi 199"5l|Halpern and Vardi 1989| 

Formally, a system 1Z is synchronous for agent i if for all points (r, m) and (r', m!) in 1Z, 
if (r, m) ~j (r', mf), then m = m'. Thus, if 1Z is synchronous for agent i, then at time m, 
agent i knows that it is time m, because it is time m at all the points he considers possible. 1Z is 
synchronous if it is synchronous for all agents. Note that the systems that model the Sleeping 
Beauty problem are not synchronous. When Sleeping Beauty is woken up on Monday, she 
does not know what day it is. 

Consider the following example of a synchronous system, taken from [ Halpern 2003 1 : 



Example 2.1: Suppose that Alice tosses two coins and sees how the coins land. Bob learns 
how the first coin landed after the second coin is tossed, but does not learn the outcome of the 
second coin toss. How should this be represented as a multiagent system? The first step is to 
decide what the local states look like. There is no "right" way of modeling the local states. 
What I am about to describe is one reasonable way of doing it, but clearly there are others. 

The environment state will be used to model what actually happens. At time 0, it is ( ), the 
empty sequence, indicating that nothing has yet happened. At time 1, it is either (H) or (T), 
depending on the outcome of the first coin toss. At time 2, it is either (H, H), (H, T), (T, H), 
or (T, T), depending on the outcome of both coin tosses. Note that the environment state is 
characterized by the values of two random variables, describing the outcome of each coin toss. 
Since Alice knows the outcome of the coin tosses, I take Alice's local state to be the same as 
the environment state at all times. 

What about Bob's local state? After the first coin is tossed, Bob still knows nothing; he 
learns the outcome of the first coin toss after the second coin is tossed. The first thought might 
then be to take his local states to have the form ( ) at time and time 1 (since he does not know 
the outcome of the first coin toss at time 1) and either (H) or (T) at time 2. This choice would 
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not make the system synchronous, since Bob would not be able to distinguish time from time 
1. If Bob is aware of the passage of time, then at time 1, Bob's state must somehow encode the 
fact that the time is 1. I do this by taking Bob's state at time 1 to be (tick), to denote that one 
time tick has passed. (Other ways of encoding the time are, of course, also possible.) Note that 
the time is already implicitly encoded in Alice's state: the time is 1 if and only if her state is 
either (H) or (T). 

Under this representation of global states, there are seven possible global states: 

• ((),(), ()), the initial state, 

• two time-1 states of the form {(Xi), (Xt), (tick)), for X\ E {H, T}, 

• four time-2 states of the form ((X U X 2 ), (X U X 2 ), (tick, X ± )), for X u X 2 G {H, T}. 

In this simple case, the environment state determines the global state (and is identical to Alice's 
state), but this is not always so. 

The system describing this situation has four runs, r 1 , . . . , r 4 , one for each of the time-2 
global states. The runs are perhaps best thought of as being the branches of the computation 
tree described in Figure 




Figure 3: Tossing two coins. 



I 

Modeling perfect recall in the systems framework requires a little care. In this framework, 
an agent's knowledge is determined by his local state. Intuitively, an agent has perfect recall if 
his local state is always "growing", by adding the new information he acquires over time. This 
is essentially how the local states were modeled in Example l2.ll In general, local states are not 
required to grow in this sense, quite intentionally. It is quite possible that information encoded 
in rj(m) — z's local state at time m in run r — no longer appears in rj(m + 1). Intuitively, 
this means that agent i has lost or "forgotten" this information. There are often scenarios of 
interest where it is important to model the fact that certain information is discarded. In practice, 
for example, an agent may simply not have enough memory capacity to remember everything 
he has learned. Nevertheless, although perfect recall is a strong assumption, there are many 
instances where it is natural to model agents as if they do not forget. 
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Intuitively, an agent with perfect recall should be able to reconstruct his complete local 
history from his current local state. To capture this intuition, let agent i 's local-state sequence 
at the point (r, m) be the sequence of local states that she has gone through in run r up to 
time m, without consecutive repetitions. Thus, if from time through time 4 in run r agent i 
has gone through the sequence (sj, Sj, s[, Si, Sj) of local states, where Si ^ s[, then her local- 
state sequence at (r, 4) is (s i; s-, Sj). Agent i's local-state sequence at a point (r, m) essentially 
describes what has happened in the run up to time m, from i's point of view. Omitting consec- 
utive repetitions is intended to capture situations where the agent has perfect recall but is not 
aware of time passing, so she cannot distinguish a run where she stays in a given state s for 
three rounds from one where she stays in s for only one round. 

An agent has perfect recall if her current local state encodes her whole local-state sequence. 
More formally, agent i has perfect recall in system TZ if, at all points (r, m) and (r 1 , m!) in TZ, 
if (r, m) ~j (r', m'), then agent i has the same local-state sequence at both (r, m) and (r 1 , ml). 
Thus, agent i has perfect recall if she "remembers" her local-state sequence at all times. 3 In a 
system with perfect recall, r^m) encodes i's local-state sequence in that, at all points where z's 
local state is r,j(m), she has the same local-state sequence. A system where agent % has perfect 
recall is shown in Figure HI 




Figure 4: An asynchronous system where agent % has perfect recall. 



The combination of synchrony and perfect recall leads to particularly pleasant properties. It 
is easy to see that if TZ is asynchronous system with perfect recall and (r, m + 1) ~j (r',m + l), 
then (r, m) ~j (r 1 , m). That is, if agent i considers run r' possible at the point (r, m + 1), then 
i must also consider run r' possible at the point (r, m). (Proof: since the system is synchronous 
and i has perfect recall, i's local state must be different at each point in r. For if i's local state 
were the same at two points (r, k) and (r, k') for k ^ k', then agent i would not know that it 
was time k at the point (r, k). Thus, at the points (r, m + 1), i's local-state sequence must have 
length m+1. Since (r, m+1) ~j (r', m + 1), i has the same local-state sequence at (r, m+1) 

3 This definition of perfect recall is not quite the same as that used in the game theory literature, where agents 
must explicitly recall the actions taken (see |Halpern 1997| for a discussion of the issues), but the difference 
between the two notions is not relevant here. In particular, according to both definitions, the agent has perfect 
recall in the "game" described by Figure^ 
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and (r 1 , m + 1). Thus, i must also have the same local-state sequence at the points (r, m) and 
(r', m), since i's local-state sequence at these points is just the prefix of i's local-state sequence 
at (r, m + 1) of length m. It is then immediate that (r, m) ~j (r', m).) Thus, in a synchronous 
system with perfect recall, agent i's information set refines over time, as shown in Figure|5] 4 



Figure 5: A synchronous system with perfect recall. 



Note that whether the agent has perfect recall in the Sleeping Beauty problem depends in 
part on how we model the problem. In the system Tl\ she does; in 1Z 2 she does not. For 
example, at the point [r^, 2) in 1Z 2 , where her local state is (T, w), she has forgotten that she 
was woken up at time 1 (because she cannot distinguish (r 2 , 2) from (r 2 , 2)). (It may seem 
strange that the agent has perfect recall in TZi, but that is because in TZi, the time that the agent 
is asleep is not actually modeled. It happens "between the points". If we explicitly include local 
states where the agent is asleep, then the agent would not have perfect recall in the resulting 
model. The second interpretation of TZi, where the agent is unaware of time passing, is perhaps 
more compatible with perfect recall. I use 7?-i here so as to stress that perfect recall is not really 
the issue in the Sleeping Beauty problem; it is the asynchrony.) 



3 Adding probability 

To add probability to the framework, I start by assuming a probability on the set of runs in 
a system. Intuitively, this should be thought of as the agents' common probability. It is not 
necessary to assume that the agents all have the same probability on runs; different agents may 
have use probability measures. Moreover, it is not necessary to assume that the probability is 
placed on the whole set of runs. There are many cases where it is convenient to partition the set 

4 In the language of probabilists, in synchronous systems with perfect recall, information sets form a filtration 
|Billingsley 1986| Section 35]. The importance of assuming that the information sets form a filtration in the 
context of the Sleeping Beauty problem is emphasized by Schervish, Seidenfeld, and Kadane 1200411 . However, 
my analysis applies in the asynchronous case applies despite the fact that the information sets do not form a 
filtration. 
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of runs and put a separate probability measure on each cell in the partition (see [ |Halpern 2003| 
for a discussion of these issues). However, to analyze the Sleeping Beauty problem, it suffices 
to have a single probability on the runs. A probabilistic system is a pair (1Z, Pr), where 1Z is a 
system (a set of runs) and Pr is a probability on 1Z. (For simplicity, I assume that 1Z is finite 
and that all subsets of 1Z are measurable.) In the case of the Sleeping Beauty problem, the 
probability on TZi is immediate from the description of the problem: each of T\ and r 2 should 
get probability 1/2. To determine a probability on the runs of 1Z 2 , we need to decide how likely 
it is that the agent will discover that it is actually Monday. Suppose that probability is a. In 
that case, r\ and r 2 both get probability (1 — a)/2, while r\ and r 2 both get probability a/2. 

Unfortunately, the probability on runs is not enough for the agent to answer questions like 
"What is the probability that heads was tossed?" if she is asked this question at the point 
(7*1, 1) when she is woken up in TZi, for example. At this point she considers three points 
possible: (r 1; 1), (r 2 , 1), and (r 2 , 2), the three points where she is woken up. She needs to put 
a probability on this space of three points to answer the question. Obviously, the probability 
on the points should be related to the probability on runs. But how? That is the topic of this 
section. 

As the preceding discussion should make clear, points can be viewed possible worlds. In 
HT, a modal logics of knowledge and probability is considered where truth is defined relative 
to points in a system. Points are somewhat analogous to what Lewis HI 9791 calls centered 
possible worlds, since they are equipped with a time (although they are not equipped with a 
designated individual). Runs can then be viewed as uncentered possible worlds. Lewis 119791 
argued that credence should be placed not on possible worlds, but on centered possible worlds. 
The key issue here is that in many applications, it is more natural to start with a probability on 
uncentered worlds; the question is how to define a probability on centered worlds. 5 

3.1 The synchronous case 

Tuttle and I suggested a relatively straightforward way of going from a probability on runs 
to a probability on points in synchronous systems. For all times m, the probability Pr on 7Z, 
the set of runs, can be used to put a probability Pr m on the points in 1Z m = {(r, m) : r G 
1Z}\ simply take Pr m (r, m) = Pr(r). Thus, the probability of the point (r, m) is just the 
probability of the run r. Clearly, Pr m is a well-defined probability on the set of time-m points. 
Since 1Z is synchronous, at the point (r, m), agent i considers possible only time-m points. 
That is, all the points in /Q(r, m) = {(r', m') : (r, m) ~, (r', m')} are actually time-m points. 
Since, at the point (r, m), the agent considers possible only the points in /Q(r, m), it seems 
reasonable to take the agent's probability at the point (r, m) to the result of conditioning Pr m 

5 As a cultural matter, in the computer science literature, defining truth/credence relative to centered worlds is 
the norm. Computer scientists are, for example, interested in temporal logic for reasoning about what happens 
while a program is running IMa nna and Pnueli 19921 . Making time part of the world is necessary for this rea- 
soning. Interestingly, economists, like philosophers, have tended to focus on uncentered worlds. I have argued 
elsewhere |Halpern 1997| that centered worlds (represented as points) are necessary to capture some important 
temporal considerations in the analysis of games. 
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on /Cj(r, m), provided that Pr m (/Q(r, m)) > 0, which, for simplicity, I assume here. Taking 
Pr(r,m,i) to denote agent i's probability at the point (r, m), this suggests that Pr( r m j)(r', m) = 
Pr m ((V,m) | Ki(r,m)). 

To see how this works, consider the system of Example l2.1[ Suppose that the first coin has 
bias 2/3, the second coin is fair, and the coin tosses are independent, as shown in Figure 
Note that, in Figure the edges coming out of each node are labeled with a probability, which 
is intuitively the probability of taking that transition. Of course, the probabilities labeling the 
edges coming out of any fixed node must sum to 1, since some transition must be taken. For 
example, the edges coming out of the root have probability 2/3 and 1/3. Since the transitions 
in this case (i.e., the coin tosses) are assumed to be independent, it is easy to compute the 
probability of each run. For example, the probability of run r 1 is 2/3 x 1/2 = 1/3; this 
represents the probability of getting two heads. 

- / \ - 

3/ N3 



1 


1 / 


\ 1 
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2 / 


\2 








j -.2 




/' 1 



Figure 6: Tossing two coins, with probabilities. 



3.2 The general case 

The question now is how the agents should ascribe probabilities in arbitrary (not necessarily 
synchronous) system, such as that of the Sleeping Beauty problem. The approach suggested 
above does not immediately extend to the asynchronous case. In the asynchronous case, the 
points in /Q(r, m) are not in general all time-m points, so it does not make sense to condi- 
tion Pr m on /Q(r, m). (Of course, it would be possible to condition on the time-m points 
in /Q(r, m), but it is easy to give examples showing that doing this gives rather nonintuitive 
results.) 

I discuss two reasonable candidates for ascribing probability in the asynchronous case here, 
which are generalizations of the two approaches that Elga considers. I first consider these ap- 
proaches in the context of the Sleeping Beauty problem, and then give the general formaliza- 
tion. 

Consider the system described in Figure UJ but now suppose that the probability of r x is j3 
and the probability of r 2 is 1 — /?. (In the original Sleeping Beauty problem, /3 — 1/2.) It seems 
reasonable that at the points (n, 0) and (r 2 , 0), the agent ascribes probability (3 to (n, 0) and 1— 
P to (r 2 , 0), using the HT approach for the synchronous case. What about at each of the points 
(7*1, 1), (r 2 , 1), and (r 2 , 2)? One approach (which I henceforth call the HT approach, since it 
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was advocated in HT), is to say that the probability ft of run r\ is projected to the point (r 1; 1), 
while the probability I— ft of r 2 is projected to (r 2 , 1) and (r 2 , 2). How should the probability be 
split over these two points? Note that splitting the probability essentially amounts to deciding 
the relative probability of being at time 1 and time 2. Nothing in the problem description gives 
us any indication of how to determine this. HT avoid making this determination by making 
the singleton sets {(r 2 , 1}} and {(r 2 , 2)} nonmeasurable. Since they are not in the domain of 
the probability measure, there is no need to give them a probability. The only measurable sets 
in this space would then be 0, {(r 1; 1)}, {(r 2 , 1), (r 2 , 2)}, and {(n, 1), (r 2 , 1), (r 2 , 2)}, which 
get probability 0, ft, 1 — ft, and 1, respectively. An alternative is to apply the Principle of 
Indifference and take times 1 and 2 to be equally likely. In this case the probability of the set 
{((r 2 , 1), (r 2 , 2)} is split over (r 2 , 1) and (r 2 , 2), and they each get probability (l — ft)/2. When 
ft = 1/2, this gives Elga's first solution. Although it is reasonable to assume that times 1 and 2 
are equally likely, the technical results that I prove hold no matter how the probability is split 
between times 1 and 2. 

The second approach, which I call the Elga approach (since it turns out to generalize what 
Elga does), is to require that for any pair of points (r, m) and (r 1 , m') on different runs, the 
relative probability of these points is the same as the relative probability of r and r'. This 
property is easily seen to hold for the HT approach in the synchronous case. With this approach, 
the ratio of the probability of (ri, 1) and (r 2 , 1) is ft : 1 — ft, as is the ratio of the probability of 
(ri, 1) and (r 2 , 2). This forces the probability of (r±, 1) to be ft/ (2 — ft), and the probability of 
each of (r l7 1) and (r 2 , 2) to be (1 — ft)/ (2 — ft). Note that, according to the Elga approach, if 
Pr is the probability on the runs of TZi, ft = 1/2, so that Pr(rx) = Pr(r 2 ) = 1/2, and Pr' is the 
probability that the agent assigns to the three points in the information set, then 

Pr'((n,l) I {(ri,l),(r 2 ,l)}) 
= Pr'((n,l) I {(ri,l),(r 2 ,2)}) 
= Pr(n I {n,r 2 }) 
= 1/2. 

Thus, we must have Pr'((ri, 1)) = Pr'((r 2 , 1)) = Pr'((r 2 , 2)), so each of the three points has 
probability 1/3, which is Elga's second solution. Moreover, note that 

Pr'((r 1; 1) I {(n, 1), (r 2 , 1)}) = Pr'((r 2 , 1) | {(n, 1), (r 2 , 2)}) = 1/2. 

This is one way of formalizing the first step of Elga's argument; i.e., that Pr' should have the 
property that, conditional on learning it is Monday, you should consider "it is now Monday 
and the coin landed heads" and "it is now Monday and the coin landed tails" equally likely. 
The second step of Elga's argument used the Principle of Indifference to conclude that, if the 
coin landed tails, then all days were equally likely. That use of the Principle of Indifference is 
implicit in the assumption that the relative probability of (r±, m) and (r 2 , m) is the same for 
m = 1 and m = 2. 

To summarize, the HT approach assigns probability among points in an information set / 
by dividing the probability of a run r among the points in / that lie on r (and then normalizing 
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so that the sum is one), while the Elga approach proceeds by giving each and every point in I 
that is on run r the same probability as that of r, and then normalizing. 

For future reference, I now give a somewhat more precise formalization of the HT and Elga 
approaches. To do so, it is helpful to have some notation that relates sets of runs to sets of 
points. If S is a set of runs and U is a set of points, let S(U) be the set of runs in S going 
through some point in U . and let U (S) be the set of points in U that lie on some run in S. That 

is, 

= {r G S : (r, m) G U for some m} and 
U(S) = {(r,m) eU :reS}. 

Note that, in particular, /Q(r, m){r') is the set of points in the information set /Q(r, m) that 
are on the run r' and lZ(K,i(r,m)) is the set of runs in the system 1Z that contain points in 
/Cj(r, m). According to the HT approach, if Pr^ is agent z's probability on 1Z, the set of runs, 
then Prf r T m i) (lCi(r, m)(r')) = Pr^r' | lZ(JCi(r,m))). (Note that here I am using Pr^J m) to 
denote agent i's probability at the point (r, m) calculated using the HT approach; I similarly 
will use Prf^m) to denote agent i's probability calculated using the Elga approach.) That is, 
the probability that agent % assigns at the point (r, m) to the points in r' is just the probability of 
the run r' conditional on the probability of the runs going through the information set /Q(r, m). 
As I said earlier, Halpern and Tuttle do not try to assign a probability to individual points in 
}Ci(r, m) (r') if there is more than one point on r' in /Q(r, m). 

By way of contrast, the Elga approach is defined as follows: 

{W ' ] E P /^( M r I m))Pri(r l O|/C i (r l m)({r"})r 

It is easy to check that Prf.^^ is the unique probability measure Pr' on /Q(r, m) such that 

Pr / ((r 1 ,m 1 ))/Pr / ((r 2 ,m 2 )) = Pr^r^/Pr^ra) if Pri(r 2 ) > 0. Note that Pr^. } assigns 

equal probability to all points on a run r' in /Cj(r, m). Even if Pr^^ ^ is extended so that 

all points on a given run are taken to be equally likely, in general, Pr^ ^ ^ Pr^'mi)- The 
following lemma characterizes exactly when the approaches give identical results. 

Lemma 3.1: Pr^ = Pr^ #|/Q(r, m)({n})| = \1d(r, m)({r 2 })| for all runs n,r 2 G 
TZ(JCi(r,m)) such that Pr j(ry) 7^ Oforj = 1,2. 

Note that, in the synchronous case, |/Q(r, = 1 for all runs r' G 1Z(K,i(r, m)), so 

the two approaches are guaranteed to give the same answers. 



4 Comparing the Approaches 

I have formalized two approaches for ascribing probability in asynchronous settings, both of 
which generalize the relatively noncontroversial approach used in the synchronous case. Which 
is the most appropriate? I examine a number of arguments here. 
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4.1 Elga's Argument 



Elga argued for the Elga approach, using the argument that if you discover or learn that it is 
Monday, then you should consider heads and tails equally likely. As I suggested above, I do 
not find this a compelling argument for the Elga approach. I agree that if you learn that it 
is Monday, you should consider heads and tails equally likely. On the other hand, Sleeping 
Beauty does not actually learn that it is Monday. Elga is identifying the probability of heads 
conditional on learning that it is Monday with the probability of heads given that it is Monday. 
While these probabilities could be equal, they certainly do not have to be. An example of 
Thomason makes the point nicely: 6 If I think my wife is much more clever than I, then I 
might be convinced that I will never learn of her infidelity should she be unfaithful. So, my 
conditional probability for Y, "I will learn that my wife is cheating on me", given X,"She will 
cheat on me", is very low. Yet, the probability of Y if I actually learn X is clearly 1 ? 

In any case, in asynchronous systems, the two probabilities may be unequal for reasons 
beyond those that arise in the synchronous case. This is perhaps best seen by considering 
a system where the agent might actually learn that it is Monday. The system 1Z 2 described 
Figure|2]is one such system. Note that in 1Z 2 , even if the HT approach is used, if you discover 
it is Monday in run r\ or r 2 , then you do indeed ascribe probability 1/2 to heads. On the 
other hand, in n and r 2 , where you do not discover it is Monday, you also ascribe probability 
1/2 to heads when you are woken up, but conditional on it being Monday, you consider the 
probability of heads to be 2/3. Thus, using the HT approach, 1Z 2 gives an example of a system 
where the probability of heads given that it is Monday is different from the probability of heads 
conditional on learning that it is Monday. 

Although 1Z 2 shows that Elga's argument for the 1/3-2/3 answer is suspect, it does not fol- 
low that 1/3-2/3 is incorrect. In the remainder of this section, I examine other considerations 
to see if they shed light on what should be the appropriate answer. 

4.2 The Frequency Interpretation 

One standard interpretation of probability is in terms of frequency. If the probability of a coin 
landing heads is 1/2, then if we repeatedly toss the coin, it will land heads in roughly half the 
trials; it will also land heads roughly half the time. In the synchronous case, "half the trials" 
and "half the time" are the same. But now consider the Sleeping Beauty problem. What counts 
as a "trial"? If a "trial" is an experiment, then the coin clearly lands heads in half of the trials. 
But it is equally clear that the coin lands heads 1/3 of the times that the agent is woken up. 
Considering "times" and "trials" leads to different answers in asynchronous systems; in the 

6 Thanks to Jim Joyce for pointing out this example. 

7 There are other reasons why the probability of Y given X might be different from the probability of Y 
given that you learn or observe X. In the latter case, you must take into account how you came to learn that X 
is the case. Without taking this into account, you run into difficulties with, say, the Monty Hall problem. See 
|Griinwald and Halpern 200 3] for a discussion of this point in the synchronous setting. I ignore this issue here, 
since it is orthogonal to the issues that arise in the Sleeping Beauty problem. 
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case of the Sleeping Beauty problem, these different answers are precisely the natural 1/2-1/2 
and 1/3-2/3 answers. I return to this issue in the next subsection. 

4.3 Betting Games 

Another standard approach to determining subjective probability, which goes back to Ram- 
sey H1931I and De Finetti 119311 . is in terms of betting behavior. For example, one way of 
determining the subjective probability that an agent ascribes to a coin toss landing heads is 
to compare the odds at which he would accept a bet on heads to one at which he would ac- 
cept a bet on tails. While this seems quite straightforward, in the asynchronous case it is not. 
This issue was considered in detail in the context of the absented-minded driver paradox in 
| Grove and Halpern 1997[ . Much the same comments hold here, so I just do a brief review. 

Suppose that Sleeping Beauty is offered a $1 bet on whether the coin landed heads or the 
coin landed tails every time she is woken up. If the bet pays off every time she answers the 
question correctly, then clearly she should say "tails". Her expected gain by always saying tails 
is $1 (since, with probability 1/2, the coin will land tails and she will get $1 both times she 
is asked), while her expected gain by always saying heads is only 1/2. Indeed, a risk-neutral 
agent should be willing to pay to take this bet. Thus, even though she considers heads and tails 
equally likely and ascribes probabilities using the HT approach, this betting game would have 
her act as if she considered tails twice as likely as heads: she would be indifferent between 
saying "heads" and "tails" only if the payoff for heads was $2, twice the payoff for tails. 

In this betting game, the payoff occurs at every time step. Now consider a second betting 
game, where the payoff is only once per trial (so that if the coin lands tails, the agent get $1 if 
she says tails both times, and $0.50 if she says tails only once). If the payoff is per trial, then 
the agent should be indifferent being saying "heads" and "tails"; the situation is analogous to 
the discussion in the frequency interpretation. 

There is yet a third alternative. The agent could be offered a bet at only one point in the 
information set. If the coin lands heads, she must be offered the bet at (r 1; 1). If the coin 
lands heads, an adversary must somehow choose if the bet will be offered at (r 2 , 1) or (r 2 , 2). 
The third betting game is perhaps more in keeping with the second story told for TZ 1 , where 
the agent is not aware of time passing and must assign a probability to heads and tails in the 
information set. It may seem that the first betting game, where the payoff occurs at each step, 
is more appropriate to the Sleeping Beauty problem — after all, the agent is woken up twice if 
the coin lands tails. Of course, if the goal of the problem is to maximize the expected number 
of correct answers (which is what this betting game amounts to), then there is no question that 
"tails" is the right thing to say. On the other hand, if the goal is to get the right answer "now", 
whenever now is, perhaps because this is the only time that the bet will be offered, then the 
third game is more appropriate. My main point here is that the question of the right betting 
game, while noncontroversial in the synchronous case, is less clear in the asynchronous case. 

It is interesting to see how these issues play out in the context of Hitchcock's H2004I Dutch 
Book analysis of the Sleeping Beauty problem. As Hitchcock points out, there is a collection 
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of bets that form a Dutch book, which can be offered by a Bookie who knows no more than 
Sleeping Beauty provided Sleeping Beauty ascribes probability 1 /2 to heads when she wakes 
up: 8 

• Before the experiment starts, Sleeping Beauty is offered a bet that pays off $30 if the coin 
lands tails and otherwise, and costs $15. Since heads and tails are viewed as equally 
likely before the experiment starts, this is a fair bet from her point of view. 

• Each time Sleeping Beauty is woken up, she is offered a bet that pays off $20 if the coin 
lands heads and otherwise, and costs $10. Again, if Sleeping Beauty views heads and 
tails as equally likely when she is woken up, this bet is fair from her point of view. 

Note that, if the coin lands heads, Sleeping Beauty is only waken up once, so she loses $15 on 
the first bet and has a net gain of $10 on the second bet, for an overall loss of $5. On the other 
hand, if the coin lands heads, Sleeping Beauty has a net gain of $15 on the first bet, but the 
second bet is offered twice and she has a loss of $10 each time it is offered. Thus, she again 
has a net loss of $5. 

This Dutch Book argument is essentially dealing with bets that pay off at each time step, 
since if the coin lands tails, Sleeping Beauty loses $10 each time she is woken up. By way of 
contrast, consider the following sequence of bets: 

• Before the experiment starts, Sleeping Beauty is offered a bet that pays off $30 if the 
coin lands heads and otherwise, and costs $15. 

• Each time Sleeping Beauty is woken up, she is offered a bet that pays off $30 if the coin 
lands tails and otherwise, and costs $20, with the understanding that the bet pays off 
only once in each trial. In particular, if the coin in fact lands tails, and Sleeping Beauty 
takes the bet both times she is woken up, she gets the $30 payoff only once (and, of 
course, only has to pay $20 for the bet once). The accounting is done at the end of the 
trial. 

Note that the first bet is fair just as in the first Dutch Book, and the second bet is fair to an agent 
who ascribes probability 2/3 to tails when woken up, even though the payoff only happens 
once if the coin lands tails. Moreover, although the second bet is somewhat nonstandard, there 
is clearly no difficulty deciding when it applies and how to make payoffs. And, again, an agent 
who accepts all these bets will lose $5 no matter what happens. 

4.4 Conditioning and the Reflection Principle 

To what extent is it the case that the agent's probability over time can be viewed as changing 
via conditioning? It turns out that the answer to this question is closely related to the question 

8 The importance of taking the knowledge of the Bookie into account, which is stressed by Hitchcock, is also 
one of the key points in |Halpern and Tuttle 1993| . Indeed, it is argued by Halpern and Tuttle that probability 
does not make sense without taking the knowledge of the adversary (the Bookie in this case) into account. 



17 



of when the Reflection Principle holds, and gives further support to using the HT approach to 
ascribing probabilities in the asynchronous case. 

There is a trivial sense in which updating is never done by conditioning: At the point 
(r, m), agent i puts probability on the space JCi(r, m); at the point (r,m + 1), agent i puts 
probability on the space JCi(r,m + 1). These spaces are either disjoint or identical (since 
the indistinguishability relation that determines /Q(r, m) and /Q(r, m + 1) is an equivalence 
relation). Certainly, if they are disjoint, agent i cannot be updating by conditioning, since the 
conditional probability space is identical to the original probability space. And if the spaces 
are identical, it is easy to see that the agent is not doing any updating at all; her probabilities 
do not change. 

To focus on the most significant issues, it is best to factor out time by considering only the 
probability ascribed to runs. Technically, this amounts to considering run-based events, that is 
sets U of points with the property that if (r, m) £ U, then (r, m') £ U for all times m' . In other 
words, U contains all the points in a given run or none of them. Intuitively, we can identify U 
with the set of runs that have points in U. To avoid problems of how to assign probability in 
asynchronous systems, I start by considering synchronous systems. Given a set V of points, let 
V~ = {(r, m) : (r, m + 1) £ V}; that is, V~ consists of all the points immediately preceding 
points in V. The following result, whose straightforward proof is left to the reader, shows that 
in synchronous systems where the agents have perfect recall, the agents do essentially update 
by conditioning. The probability that the agent ascribes to an event U at time m + 1 is obtained 
by conditioning the probability he ascribes to U at time m on the set of points immediately 
preceding those he considers possible at time m + 1. 

Theorem 4.1: MHalpern 2003| Let U be a run-based event and let TZ be a synchronous system 
where the agents have perfect recall. Then 

Pr rjm+1)i (?7) = Pr rjTOji (f7 | tCi(r,m+ 1) _ ). 

Theorem 14. II does not hold without assuming perfect recall. For example, suppose that an 
agent tosses a fair coin and observes at time 1 that the outcome is heads. Then at time 2 he 
forgets the outcome (but remembers that the coin was tossed, and knows the time). Thus, at 
time 2, because the outcome is forgotten, the agent ascribes probability 1/2 to each of heads 
and tails. Clearly, her time 2 probabilities are not the result of applying conditioning to her 
time 1 probabilities. 

A more interesting question is whether Theorem 14. 1 1 holds if we assume perfect recall and 
do not assume synchrony. Properly interpreted, it does, as I show below. But, as stated, it 
does not, even with the HT approach to assigning probabilities. The problem is the use of 
/Q(r, m + 1)~ in the statement of the theorem. In an asynchronous system, some of the points 
in JCi(r, m + 1) - may still be in /Q(r, m + 1), since the agent may not be aware of time passing. 
Intuitively, at time (r, m), we want to condition on the set of points in /Q(r, m) that are on runs 
that the agent considers possible at (r, m + 1). But this set is not necessarily /Q(r, m + 1)~. 
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Let Ki(r,m+ l) (r ' m) = {(r',k) e /Q (r,m) : 3m'((r,m + 1) ~j (r',m'))}. Note that 
/Cj(r, m + l)( r ' m ) consists precisely of those points that agent considers possible at (r, m) that 
are on runs that the agent still considers possible at (r, m + 1). In synchronous systems with 
perfect recall, /Q(r, m + l)( r ' m ) = /Q(r, m + 1)~ since, as observed above, if (r, m + 1) ~j 
(r',m + 1) then (r, m) ~j (r',m). In general, however, the two sets are distinct. Using 
/Q(r, m + l)( r,m ) instead of K~ m+l gives an appropriate generalization of Theorem 14. 1[ 

Theorem 4.2: | |Halpern 2003| Let U be a run-based event and let 1Z be a system where the 
agents have perfect recall. Then, 

P*Z+i,i( U ) = P <U U I Ki(r,m+ l) (r ' m) ). 

Thus, in systems with perfect recall, using the HT approach to assigning probabilities, up- 
dating proceeds by conditioning. Note that since the theorem considers only run-based events, 
it holds no matter how the probability among points on a run is distributed. For example, in the 
Sleeping Beauty problem, this result holds even if (r 2 , 1) and (r 2 , 2) are not taken to be equally 
likely. 

The analogue of Theorem 14.21 does not hold in general for the Elga approach. This can 
already be seen in the Sleeping Beauty problem. Consider the system of Figured] At time 
(in either r\ or r 2 ), the event heads (which consists of all the points in r{) is ascribed probability 
1/2. At time 1, it is ascribed probability 1/3. Since K S b{t x , l) (ri,0) = {(r^O), (r 2 ,0)}, we 
have 

1/3 = K: 9 lsB(heads) ? Pr?% B (heads) | ^(r,, = 1/2. 

The last equality captures the intuition that if Sleeping Beauty gets no additional information, 
then her probabilities should not change using conditioning. 

Van Fraassen's 1199511 Reflection Principle is a coherence condition connecting an agent's 
future beliefs and his current beliefs. Note that what an agent believes in the future will depend 
in part on what the agent learns. The Generalized Reflection Principle says that an agent's 
current belief about an event U should lie in the span of of the agent's possible beliefs about 
U at some later time m. That is, if Pr describes the agent's current beliefs, and Pr l5 . . . , Pr fe 
describe the agent's possible beliefs at time m, then for each event U, Pr({7) should lie between 
minj Pr j(U) and max\,- Pvj(U). Savage's 119541 Sure-Thing Principle is essentially a special 
case of the Generalized Reflection Principle. It says that if the probability of A is a no matter 
what is learned at time m, then the probability of A should be a right now. This certainly 
seems like a reasonable criterion. 

Van Fraassen H1995I in fact claims that if an agent changes his opinion by conditioning on 
evidence, that is, if Pr, = Pr(- | E(j, m)) for j = 1, . . . , k, then the Generalized Reflection 
Principle must hold. The intuition is that the pieces of evidence E(l, m), . . . , E(k, m) must 
form a partition of underlying space (in each state, exactly one piece of evidence will be ob- 
tained), so that it becomes a straightforward application of elementary probability theory to 
show that if <x, = Pi(E(j, t)) for j = 1, . . . ,k, then Pr = aiPix + • • • + a^Pr/c. 
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Van Fraassen was assuming that the agent has a fixed set W of possible worlds, and his 
probability on W changed by conditioning on new evidence. Moreover, he was assuming that 
the evidence was a subset of W. In the multiagent systems framework, the agent is not putting 
probability on a fixed set of worlds. Rather, at each time k, he puts probability on the set of 
worlds (i.e., points) that he considers possible at time k. The agent's evidence is an information 
set — a set of points. If we restrict attention to run-based events, we can instead focus on the 
agent's probabilities on runs. That is, we can take W to be the set of runs, and consider how 
the agent's probability on runs changes over time. Unfortunately, agent i's evidence at a point 
(r, m) is not a set of runs, but a set of points, namely /Q(r, m). We can associate with /Q(r, m) 
the set of runs going through the points in /Q(r, m), namely, in the notation of Section EOl 
TZ()Ci(r,m)) 

In the synchronous case, for each time m, the possible information sets at time m cor- 
respond to the possible pieces of evidence that the agent has at time m. These information 
sets form a partition of the time-m points, and induce a partition on runs. In this case, van 
Fraassen's argument is correct. More precisely, if, for simplicity, "now" is taken to be time 0, 
and we consider some future time m > 0, the possible pieces of evidence that agent i could 
get at time m are all sets of the form /Q(r, m), for r E TZ. With this translation of terms, it 
is an immediate consequence of van Fraassen's observation and Theorem 14. II that the Gener- 
alized Reflection Principle holds in synchronous systems with perfect recall. But note that the 
assumption of perfect recall is critical here. Consider an agent that tosses a coin and observes 
that it lands heads at time 0. Thus, at time 0, she assigns probability 1 to the event of that coin 
toss landing heads. But she knows that one year later she will have forgotten the outcome of 
the coin toss, and will assign that event probability 1/2 (even though she will know the time). 
Clearly Reflection does not hold. 

What about the asynchronous case? Here it is not straightforward to even formulate an 
appropriate analogue of the Reflection Principle. The first question to consider is what pieces 
of evidence to consider at time m. While we can consider all the information sets of form 
/Q(r, m), where m is fixed and r ranges over the runs, these sets, as we observed earlier, 
contain points other than time m points. While it is true that either /Q(r, m) is identical to 
/Q(r', m) or disjoint from JCi(r', m), these sets do not induce a partition on the runs. It is quite 
possible that, even though the set of points /Q(r, m) and /Q(r', m) are disjoint, there may be 
a run r" and times m\ and m 2 such that (r",mi) E /Q(r, m) and (r",m 2 ) G JCi(r',m). For 
example, in Figured if the runs from left to right are ri~r 5 , then JCsb^s, 1) = {r±, . . . , r 5 } 
and Ksb{ti^ 1) = {ri,r 2 ,r 3 }. However, under the assumption of perfect recall, it can be 
shown that for any two information sets /Q(r 1; m) and /Q(r 2 , m), either (a) TZ(JC i (r 1 , m)) C 
n{Ki{r 2l m)), (b) n{Ki{r 2l m)) C ^(n.m)), or (c) ^(^(n, m)) n ft(/Q(r 2 , m)) = 0. 
From this it follows that there exist a collection 1Z' of runs such that the sets 7^(/Cj(r', m)) for 
r' G TZ' are disjoint and the union of 7Z(JCi(r', m)) taken over the runs r' G TZ' consists of all 
runs in TZ. Then the same argument as in the synchronous case gives the following result. 

Theorem 4.3c If TZ is a ( synchronous or asynchronous ) system with perfect recall and JCi(ri,m), 
. . . , JCi(r k , m) are the distinct information sets of the form /Q(r', m) for r' G TZ(JCi(r, 0), then 
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there exist cti, 



, «fc such that 



k 

Pr,(- | 72(/Q(r,0))) = E«i Pr i(- I ^(/C 4 (r„m))). 

i=i 

The following corollary is immediate from Theorem 14 .31 given the definition of Pr£[ r \. 

Corollary 4.4 If TZ is a (synchronous or asynchronous) system with perfect recall and JC i (r 1 , m), 
. . . , /Q(rfc, m) are the distinct information sets of the form JCi(r', m) for r' £ 72-(/Cj(r, 0), 
?/zere exz'sf «i, . . . , such that for all TV C 72., 

Pr^,o)(^(r,0)(72')) = £ ^Pr^ m) (/Q(r,, *»))(#))■ 

Corollary 14.41 makes precise the sense in which the Reflection Principle holds for the HT 
approach. Although the notation /Q(r, m) (JZ r ) that converts sets of runs to sets of points makes 
the statement somewhat ugly, it plays an important role in emphasizing what I take to be an 
important distinction, that has largely been ignored. An agent assigns probability to points, 
not runs. At both time and time m we can consider the probability that the agent assigns to 
the points on the runs in TV, but the agent is actually assigning probability to quite different 
(although related) events at time and time m. It is important to note that I am not claiming 
here that ay = Pr(lZ(K,i(rj, m)) in Theorem 14.31 While this holds in the synchronous case, 
it does not hold in general. The reason we cannot expect this to hold in general is that, in the 
synchronous case, the sets 1Z(JCi(rj,m)) are disjoint, so S?=i ^ T (TZ()Ci(rj, m)) = 1. This is 
not in general true in the asynchronous case. I return to this issue shortly. 

The obvious analogue to Corollary 14.41 does not hold for the Elga approach. Indeed, the 
same example that shows conditioning fails in the Sleeping Beauty problem shows that the 
Reflection Principle does not hold. Indeed, this example shows that the sure-thing principle 
fails too. Using the Elga approach, the probability of heads (i.e., the probability of the points 
on the run where the coin lands heads) changes from 1/2 to 1/3 between time and time 1, no 
matter what. 

Arntzenius H2003I gives a number of other examples where he claims the Reflection Princi- 
ple does not hold. In all of these examples, the agent either has imperfect recall or the system is 
asynchronous and the Elga approach is being used to ascribe probabilities. Thus, his observa- 
tion may not seem surprising, given the previous analysis. However, in one case, according to 
my definition, Reflection in fact does not fail. This is due to the fact that I interpret Reflection 
in a slightly different way from Arntzenius. Since this example is of independent interest, I 
now consider it more carefully. 

The example, credited by Arntzenius to John Collins, is the following: A prisoner has in his 
cell two clocks, both of which run perfectly accurately. However, clock A initially reads 6 PM 
and clock B initially reads 7 PM. The prisoner knows that exactly one of the clocks is accurate; 
he believes that with probability 1/2 the accurate clock is clock A and with probability 1/2 it 
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is clock B. The prisoner also knows that a fair coin has been tossed to determine if the lights 
go out at midnight; if it lands heads, they do, and if it lands tails, they stay on. Since the coin 
is fair, the prisoner initially places probability 1/2 on it landing heads. 

There are four runs in the system corresponding to this problem, each of which has proba- 
bility 1/4: 

• ri, where A is the accurate clock and the coin landed heads; 

• r 2 , where A is the accurate clock and the coin landed tails; 

• r 3 , where B is the accurate clock and the coin landed heads; 

• r 4 , where B is the accurate clock and the coin landed tails. 

We can assume that the environment state encodes the true time and the outcome of the coin 
toss, while the prisoner's state encodes the clock readings and whether the light is off or on. 
Thus, a typical global state might have the form ((1 1:30, H), (1 1:30, 12:30, 1)). In this global 
state, the true time is 11:30 and the coin landed heads, clock A reads 11:30 (and is correct), 
clock B reads 12:30, and the light is on (denoted by the component 1 in the tuple). Thus, 
this is the global state at the point (ri, 1 1:30). The other points in the same information set as 
(rx, 11:30) are (r 2 , 11:30) and (r 4 , 12:30). Call this information set Ix. At all the three points 
in fx, the prisoner's local state is (11:30, 12:30,1). For future reference, note that the only other 
information set that includes time 11:30 points is I 2 = {(rx, 10:30), (r 2 , 10:30), (r 3 , 11:30), (r 4 , 11:30)}. 
At all the points in I 2 , the pair of clocks read (10:30, 1 1:30) and the light is on. 

It is easy to check that every information set has at most one point per run. It follows from 
Lemma |3~T1 that at every point, the HT approach and the Elga approach agree. Thus, no matter 
which approach is used, Reflection in the sense of Corollary 14.41 must hold. Observe that the 
prisoner's degree of belief that the coin landed heads in information set I\ is 2/3, while in I 2 
it is 1/2. Thus, the prisoner's initial probability of heads (1/2) is a convex combination of his 
possible probabilities of heads at 1 1:30, but the combination has coefficients and 1. Taking 
the coefficients to be and 1 might seem a little strange. After all, why should we prefer I 2 
so strongly? But I claim that the "strangeness" here is a result of carrying over inappropriate 
intuitions from the synchronous case. In the synchronous case, the coefficients reflect the 
probability of the information sets. This makes sense in the synchronous case, because the 
information sets correspond to possible pieces of evidence that can be obtained at time m, and 
the sum of these probabilities of the pieces of evidence is 1 . However, in the asynchronous case, 
we cannot relate the coefficient to probabilities of obtaining evidence. Indeed, the "evidence" 
in the case of information set I 2 is that the clock readings are (10:30, 11:30) and the light is 
on. This is evidence that the prisoner initially knows that he will certainly obtain at some point 
(although not necessarily at 11:30). Indeed, it falls out of the analysis of Theorem 14.31 that it 
does not make sense to relate the coefficients in the asynchronous case to the probabilities of 
obtaining the evidence. 
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Arntzenius points out another anomaly in this example. Taking P t to denote the prisoner's 
probability at (real) time t, Arntzenius observes that 

Pr7 : oo(clock B is correct | Prn :3 o(clock B is correct) = 1/3) = 0. 

For Pr n : 3o (clock B is correct) = 1/3) holds only in runs r x and r 2 , since at the points (r i? 11 : 
30) and (r 2 , 11 : 30), the prisoner's probability that B is correct is 1/3, while at the points 
(r 3 , 11 : 30) and (r 4 , 11 : 30), the prisoner's probability that B is correct is 1/2. On the other 
hand, B is not correct in runs r\ and r 2 , so the conditional probability is 0. 

Arntzenius suggests that this is a problem, since the prisoner does not trust his later beliefs. 
I would argue that the prisoner should trust all his later beliefs that he is aware of. The trouble 
is, the prisoner has no idea when he has the belief Prn^o (clock B is correct) = 1/3, since he 
has no idea when it is 1 1 :30. (Essentially the same point is made by Schervish, Seidenfeld, and 
Kadane 120041 .) Of course, in a synchronous system, an agent does know when 11:30 is, so 
beliefs of the form Prn: 30 (?7) are ones he should trust. 

Note that if we modify the problem very slightly so that (a) clock A gives the true time, 
(b) the lights will be turned off when the jailer's clock reads midnight, and (c) one of A and B 
gives the jailer's time, but the prisoner does not know which and ascribes each probability 1/2, 
then we get a synchronous system which is identical to Arntzenius's in all essential details. 
However, now Reflection is completely unproblematic. At 11:30, if the light is still on, the 
prisoner ascribes probability 1/3 to heads; if the light is off, the prisoner ascribes probability 
1 to heads. Initially, the prisoner ascribes probability 3/4 to the light being on at 11:30 and 
probability 1 /4 to the light being off. Sure enough 1/2 = 3/4x1/3 + 1/4x1. 

This example emphasizes how strongly our intuitions are based on the synchronous case, 
and how our intuitions can lead us astray in the presence of asynchrony. The prisoner has 
perfect recall in this system, so the only issue here is synchrony vs. asynchrony. 

5 Conclusion 

In this paper, I have tried to take a close look at the problem of updating in the presence of 
asynchrony and imperfect recall. Let me summarize what I take to be the main points of this 
paper: 

• It is important to have a good formal model that incorporates uncertainty, imperfect 
recall, and asynchrony in which probabilistic arguments can be examined. While the 
model I have presented here is certainly not the only one that can be used, it does have a 
number of attractive features. As I have shown elsewhere [ |Halpern 1997[ , it can also be 
used to deal with other problems involved with imperfect recall raised by Piccione and 
Rubinstein [fTWTl. 

• Whereas there seems to be only one reasonable approach to assigning (and hence updat- 
ing) probabilities in the synchronous case, there are at least two such approaches in the 
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asynchronous case. Both approaches can be supported using a frequency interpretation 
and a betting interpretation. However, only the HT approach supports the Reflection 
Principle in general. In particular, the two approaches lead to the two different answers 
in the Sleeping Beauty problem. 

• We cannot necessarily identify the probability conditional on U with what the probability 
would be upon learning U. This identification is being made in Elga's argument; the 
structure 1Z 2 shows that they may be distinct. 

One fact that seems obvious in light of all this discussion is that our intuitions regarding how 
to do updating in asynchronous systems are rather poor. This is clearly a topic that deserves 
further investigation. 
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